AITopics | chi-square test

Collaborating Authors

chi-square test

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Are Large Language Models More Empathetic than Humans?

Welivita, Anuradha, Pu, Pearl

arXiv.org Artificial IntelligenceJun-7-2024

With the emergence of large language models (LLMs), investigating if they can surpass humans in areas such as emotion recognition and empathetic responding has become a focal point of research. This paper presents a comprehensive study exploring the empathetic responding capabilities of four state-of-the-art LLMs: GPT-4, LLaMA-2-70B-Chat, Gemini-1.0-Pro, and Mixtral-8x7B-Instruct in comparison to a human baseline. We engaged 1,000 participants in a between-subjects user study, assessing the empathetic quality of responses generated by humans and the four LLMs to 2,000 emotional dialogue prompts meticulously selected to cover a broad spectrum of 32 distinct positive and negative emotions. Our findings reveal a statistically significant superiority of the empathetic responding capability of LLMs over humans. GPT-4 emerged as the most empathetic, marking approximately 31% increase in responses rated as "Good" compared to the human benchmark. It was followed by LLaMA-2, Mixtral-8x7B, and Gemini-Pro, which showed increases of approximately 24%, 21%, and 10% in "Good" ratings, respectively. We further analyzed the response ratings at a finer granularity and discovered that some LLMs are significantly better at responding to specific emotions compared to others. The suggested evaluation framework offers a scalable and adaptable approach for assessing the empathy of new LLMs, avoiding the need to replicate this study's findings in future research.

emotion, llm, participant, (16 more...)

arXiv.org Artificial Intelligence

2406.05063

Country:

North America > Canada > Ontario > Toronto (0.05)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.89)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Use cases of Chi-squared test part3(Machine Learning)

#artificialintelligenceApr-1-2023, 19:55:08 GMT

Abstract: We propose a goodness-of-fit test for degree-corrected stochastic block models (DCSBM). The test is based on an adjusted chi-square statistic for measuring equality of means among groups of n multinomial distributions with d1,…,dn observations. In the context of network models, the number of multinomials, n, grows much faster than the number of observations, di, corresponding to the degree of node i, hence the setting deviates from classical asymptotics. We show that a simple adjustment allows the statistic to converge in distribution, under null, as long as the harmonic mean of {di} grows to infinity. When applied sequentially, the test can also be used to determine the number of communities.

chi-squared test part3, independence and homogeneity, machine learning, (8 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.52)

Add feedback

Discovering Insights with Chi Square Tests

#artificialintelligenceMar-4-2023, 19:55:42 GMT

Let me take you into the universe of chi-square tests and how we can involve them in Python with the scipy library. We'll be going over the chi-square integrity of the fit test. Whether the reader is just starting or an accomplished information examiner, this guide will outfit you with pragmatic models and experiences so you can unhesitatingly apply chi-square tests in your own work. This article was published as a part of the Data Science Blogathon. The Chi-Square test is one of the fact-based interactions used to assess the connection between two all-out factors to figure out the connection between them.

chi-square test, frequency, heart failure, (10 more...)

#artificialintelligence

Genre: Research Report > Experimental Study (0.33)

Industry: Health & Medicine > Therapeutic Area (0.35)

Technology:

Information Technology > Data Science (0.35)
Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

How to calculate p-value from chi-square statistic using Python? - The Security Buddy

#artificialintelligenceFeb-10-2023, 12:26:01 GMT

In one of our previous articles, we discussed how to calculate the test statistic in a chi-square test of independence or goodness-of-fit test. We also discussed that the test statistic in a chi-square test follows the chi-square distribution. So, how can we calculate the p-value from the test statistic in a chi-square test? In this article, we will discuss that. Let's say our test statistic is 6.4, and the degrees of freedom is 5. Here, we are using the chi2.sf()

chi-square statistic, statistic, test statistic, (4 more...)

#artificialintelligence

Genre: Research Report > Experimental Study (0.78)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

How to get the Letter Frequency in Python

#artificialintelligenceOct-4-2020, 16:55:19 GMT

We will provide you a walk-through example of how you can easily get the letter frequency in documents by considering the whole document or the unique words. Finally, we will compare our observed relative frequencies with the letter frequency of the English language. From the above horizontal barplot, we can easily see that the letter e is the most common in both English Texts and Dictionaries. Notice also that the distribution is changed between Texts and Dictionaries. We will work with the Moby Dick book and we will provide the frequency and the relative frequency of the letters.

artificial intelligence, frequency, letter frequency, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (0.55)

Add feedback

Concept Drift Detection via Equal Intensity k-means Space Partitioning

Zhang, Anjin Liu Jie Lu Guangquan

arXiv.org Machine LearningApr-24-2020

Data stream poses additional challenges to statistical classification tasks because distributions of the training and target samples may differ as time passes. Such distribution change in streaming data is called concept drift. Numerous histogram-based distribution change detection methods have been proposed to detect drift. Most histograms are developed on grid-based or tree-based space partitioning algorithms which makes the space partitions arbitrary, unexplainable, and may cause drift blind-spots. There is a need to improve the drift detection accuracy for histogram-based methods with the unsupervised setting. To address this problem, we propose a cluster-based histogram, called equal intensity k-means space partitioning (EI-kMeans). In addition, a heuristic method to improve the sensitivity of drift detection is introduced. The fundamental idea of improving the sensitivity is to minimize the risk of creating partitions in distribution offset regions. Pearson's chi-square test is used as the statistical hypothesis test so that the test statistics remain independent of the sample distribution. The number of bins and their shapes, which strongly influence the ability to detect drift, are determined dynamically from the sample based on an asymptotic constraint in the chi-square test. Accordingly, three algorithms are developed to implement concept drift detection, including a greedy centroids initialization algorithm, a cluster amplify-shrink algorithm, and a drift detection algorithm. For drift adaptation, we recommend retraining the learner if a drift is detected. The results of experiments on synthetic and real-world datasets demonstrate the advantages of EI-kMeans and show its efficacy in detecting concept drift.

algorithm, detection, partition, (15 more...)

arXiv.org Machine Learning

doi: 10.1109/TCYB.2020.2983962

2004.11587

Country: Oceania > Australia > New South Wales > Sydney (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

How to correctly select a sample from a huge dataset in machine learning

#artificialintelligenceMar-29-2020, 03:04:59 GMT

Let's make a simple mental experiment. Imagine that we are in a library and want to learn Dante Alighieri's Divina Commedia word by word. The right answer is very clear. Why would we want to study from different books when just one of them is enough? Machine learning is the same thing.

dataset, information, probability distribution, (13 more...)

#artificialintelligence

Genre: Research Report > Experimental Study (0.38)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback

High-Dimensional Independence Testing and Maximum Marginal Correlation

Shen, Cencheng

arXiv.org Machine LearningJan-4-2020

The statistical hypothesis for testing independence is formulated as: H 0: F XY F XF Y, H A: F XY null F XF Y. T raditional correlation measures like Pearson's correlation [13] are widely used but not applicable to detect nonlinear and high-dimensional dependence structures, whereas many recently proposed dependence measures are able to discover any dependence structure given sufficiently sample size. The most prominent pioneers are the distance correlation [22, 25] and the Hilbert-Schmidt independence criterion [4, 5]. They are shown to be asymptotically 0 if and only if independence, share similar formulation and properties [16, 19], and is valid and consistent for testing independence against any joint distribution at any fixed dimensionality . Other dependence measures are later proposed to improve the finite-sample testing power against strong nonlinear dependencies, such as the Heller-Heller-Gorfine method [6, 7], the multiscale graph correlation [18, 26], among others. A dependence measure can be useful in plenty statistical tasks, including two-sample testing [17], feature screening [10, 27, 30], time-series [2, 12, 31], conditional independence [3, 24, 28], clustering [15, 21], graph testing [9, 29], etc.

correlation, dimension, distance correlation, (16 more...)

arXiv.org Machine Learning

2001.01095

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

What is a Chi-Square Test and How Does it Work?

#artificialintelligenceDec-1-2019, 05:32:16 GMT

"Science is advanced by proposing and testing a hypothesis, not by declaring questions unsolvable" – Nick Matzke Let's start with a case study. I want you to think of your favorite restaurant right now. Let's say you can predict a certain number of people arriving for lunch five days a week. At the end of the week, you observe that the expected footfall was different from the actual footfall. Sounds like a prime statistics problem?

category, chi-square test, frequency, (14 more...)

#artificialintelligence

Genre: Research Report > Experimental Study (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

Learn R for Applied Statistics - Programmer Books

#artificialintelligenceNov-28-2019, 00:52:27 GMT

Gain the R programming language fundamentals for doing the applied statistics useful for data exploration and analysis in data science and data mining. This book covers topics ranging from R syntax basics, descriptive statistics, and data visualizations to inferential statistics and regressions. After learning R's syntax, you will work through data visualizations such as histograms and boxplot charting, descriptive statistics, and inferential statistics such as t-test, chi-square test, ANOVA, non-parametric test, and linear regressions. Learn R for Applied Statistics is a timely skills-migration book that equips you with the R programming fundamentals and introduces you to applied statistics for data explorations. Those who are interested in data science, in particular data exploration using applied statistics, and the use of R programming for data visualizations.

data visualization, inferential statistics, linear regression, (9 more...)

#artificialintelligence

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Data Science (1.00)
Information Technology > Visualization (0.99)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.83)

Add feedback